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Abstract 

We prove part of a conjecture by Johansson, Kahn and Vu [6] re- 
garding threshold functions for the existence of an //-factor in a ran- 
dom graph G{n,p). We prove that the conjectured threshold function 
is correct for any graph H which is not covered by its densest sub- 
graphs. We also demonstrate that the main result of |6] generalises to 
multigraphs, digraphs, and a multipartite model. 



Keywords: Random Graphs, Factors, Digraphs 
> 

O 

O ; 1 Introduction 

en 

^ ■ We will properly state our theorems later in the introduction, after introduc- 

er^ . ing necessary notation and background. However, for readers already familiar 

with the background or willing to momentarily gloss over the details, let us 
immediately sketch our main results and methods. In a recent breakthrough 
k> ■ (winning a 2012 Fulkerson Prize), Johansson, Kahn, and Vu [B] determined 

H ' the threshold for a random graph G to be factorable by a strictly balanced 

- ■ ■ fixed graph H, and they conjectured the threshold for every H. Our main 

result. Theorem II. ![ establishes their conjecture for 'non-vertex-balanced' 
graphs H, a class of graphs disjoint from strictly balanced ones. The pos- 
itive side is the difficult one, and the main idea of the proof is, to cover a 
fraction of G with copies of a densest subgraph of H, then contract that 
subgraph to a point, extend the cover, and repeat. However, after the first 
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step, two things have changed: the graph H may have become a multigraph 
and, more significantly, we have committed to correspondences between some 
vertices of G and H. We manage these difficulties through Theorem II ■2[ as- 
serting that, if the vertices of the random graph G are partitioned into classes 
corresponding to vertices of H, then G almost surely has an ilf -factor which 
respects the partitioning. Our proof follows the steps of the proof in [6j ; it 
is not especially inventive, but neither is it easy. Krivelevich [S] needed a 
special case and verified it, but [8] does not include the proof. The result is 
clearly useful, and it is therefore worth writing down the proof details. We 
prove a similar result for directed graphs. 

Formally, for graphs H and G, an H -factor of G is a collection of vertex- 
disjoint copies of iJ in G that form a vertex cover of G. Clearly G can only 
contain an if -factor if |\^(if)| divides |l^(G)|. We are mainly interested in 
random graphs on n vertices and and we assume throughout the paper that 
\V{H)\ divides n. 

We call a function /(n) a threshold for a graph property K if, for an Erdos- 
Renyi random graph G{n,p), (that is, the graph on n vertices where each 
edge is present with probability p independently of the absence or presence 
of any other edge) 

Pr(G(„.p) safsfies /f) ^ (^ ';^!"> = '^<{<">'' ^"^ 

[0 iip{n) = o{f{n)). 

Since containing an if -factor is an increasing property (that is, adding 
edges does not destroy any ii-factor) it is well known that a threshold func- 
tion exists, see for example [5]. Note that a threshold is unique up to multi- 
plicative positive constants so we will use notation and with slight abuse of 
language we will speak of "the" threshold. The study of thresholds for vari- 
ous classes of graphs H has attracted considerable interest. The distinctions 
center around density properties of H. We define the density of a graph H 
on at least two vertices, as 



\V(H)\-1 

Let m(H) be the maximum density of any subgraph of H, that is, 

m{H) = max {d{H') : H' C H, \H'\ > 2} . 

A graph H is called balanced if m{H) = d{H), i.e., if no subgraph of H has 
density greater than that of H, and strictly balanced if every proper subgraph 
of H has density smaller than that of H. 



For any vertex v of if, define the local density at v to be the maximum 
density restricted to subgraphs containing f , 

m{v, H) = max {d{H') : H' C H, \H'\ > 2, w G V{H')} . 

A graph H is vertex balanced if, for all v & H, m{v, H) = in{H). 

Note that if H is balanced then it is vertex balanced: a densest subgraph 
oi H is H itself, so m{v, H) and m{H) are both given by H' = H, for 
m{v,H) = m{H) = d{H). Taking the contrapositive, if H is non-vertex 
balanced then it is not balanced, and not strictly balanced. Graphs may 
thus be partitioned into those that are non-vertex balanced, those that are 
strictly balanced, and the rest (those that are vertex balanced but not strictly 
balanced). An example of a non- vertex balanced graph is shown below. 
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The thresholds for H-iactors for various fixed graph H have been of inter- 
est for a long time. The case H = K2 is simply the threshold for G to have a 
perfect matching which has been known since 1966 [3] , see also [2j for a more 
precise result. The next ii- factor threshold result was for trees by Luczak 
and Rucinski [9]. Note that matchings and trees are vertex-balanced. For 
sub-classes of non-vertex-balanced graphs, the threshold is known for graphs 
H whose minimum degree is less than m{H) [ll|5]. In 2008 the seminal paper 
by Johansson, Kahn, and Vu [6] determined the threshold for all strictly bal- 
anced graphs (also resolving the so-called 'Shamir's problem' on hypergraph 
matchings). The special case of finding the threshold of an if -factor for the 
strictly balanced graph H = K3 had been described by Janson, Luczak and 
Rucinski as one of the two 'most challenging, unsolved problems in the the- 
ory of random structures' ^ p. 96] and was first posed by Rucinski in 1992 
[TT] (the second problem was 'Shamir's problem'). 

In their paper Johansson, Kahn and Vu conjecture thresholds for all 
graphs H, depending on whether H is vertex-balanced or not. We restate this 
formally as Conjecture [1] in Section [2j Our first main result establishes this 
conjecture for all graphs in the second category. More precisely, let th.H{n) 
be the threshold function for G{n,p) to contain an if -factor. We prove the 
following. 

Theorem 1.1. If H is non-vertex-balanced, 



The main idea of the proof is, first, to embed the dense subgraphs of if, 
giving a 'partial factor' covering a corresponding proportion of the vertices of 
G{n,p). We then collapse each such subgraph of ii to a single vertex, giving 
a less dense strictly balanced graph (or possibly multigraph). Finally, we 
extend the partial factor to a full factor using Theorem 11.21 a generalisation 
to partitioned multigraphs of the strictly balanced result of [6j . 

To state Theorem 11.21 we need some more notation. Let en = \E{H)\, 
vh = \V{H)\ and V{H) = {xi,X2, ■ ■ ■ ,x^j^}. Define the r-fold blowup 
B{H, r) of H as an wjif-partite graph with parts Vi, V2, . . . , K^^, each of size 
r, with an edge between Vi E Vi and Vj e Vj iff there is an edge between 
Xi and Xj in H. In a slight abuse of notation, let H{n,p) be the random 
subgraph of B{H, n/vu) obtained by retaining each edge with probability p. 
Likewise, given a multigraph T-L, we define the random multigraph ^{n^p): 
the blowup BiTiju/vy) has as many edges between Vi G Vi and Vj G Vj as 
there are edges between Xi and Xj in "H, and again l-Lin^p) is the random 
subgraph oi BiTi, n/vy^) obtained by retaining each edge with probability p. 

The setup suggests looking for a 'restricted' "H-factor of Ti^rijp) where, 
for each copy of "H, each vertex belongs to the corresponding part of Ti^rijp). 
We will not in fact assume this, but the following theorem shows that below 
some threshold there is no factor, while above the threshold there is a factor 
of the restricted form. 

Theorem 1.2. Fix a multigraph Tifwhich may be a sim,ple graph H). IfH 
is strictly balanced, then the threshold for 'H{n,p) to contain an Ti-factor is 

thnin) = (n-V-(«)(logn)i/l^(«)l) , 

while ifH is not strictly balanced, the threshold satisfies 

th«(n) = 0(n-i/"^(^)+''«). 

In both cases, above the threshold there is w.h.p. an T-L-f actor in which, in 
every copy ofH, each vertex is in the corresponding part ofH{n,p). 

We note that there is a key difference between the partitioned and usual 
G{n,p) thresholds for non-strictly balanced graphs. In G{n,p), we show that 
for non- vertex balanced H, the o(l) term can be completely eliminated, while 
it remains in the form of a log term for strictly balanced H. In the partitioned 
random graph however, there will always be a log term. This can be seen by 
considering the graph consisting of a triangle and a single isolated vertex. 
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In G{n,p), this graph is easy to embed, as it is equivalent to a partial 
factor of triangles, taken over the whole graph, but only covering 3/4 of the 
vertices. At this point, the remaining spare vertices immediately complete 
the factor. In the partitioned case, by fixing the position of these triangles, we 
are implying the existence of a full triangle factor over those corresponding 
partitions, and as such, by the result for strictly balanced graphs, we will 
require a log term, corresponding to the densest subgraphs. 

Lastly we prove that the threshold for digraph factors coincides with 
that for graphs which is an easy consequence of Theorem 11.21 We define 
the random directed graph D{n,p) with vertex set V of size n, such that 
for each pair of vertices u and v in V, there is an edge between them with 
probabihty p independently of all other edges, and each such edge is either 
{u,v) or {v,u), with probability half each. We can prove the threshold for 
both strictly balanced and non- vertex-balanced digraphs, in D{n,p). Note 
that for strictly balanced graphs we can also prove the partitioned form, i.e., 
the digraph form of Theorem 11.21 also holds. 

Theorem 1.3. Fix a digraph H. If H is strictly balanced, then the threshold 
function thnin), for the random directed graph D{n,p) to contain an H- 
factor is 

i\iH{n) = e (n-i/"(^)(logn)i/^«) , 

while if H is non-vertex-balanced, 

thnin) = (n-i/'"(-f^)) . 

2 Preliminaries 

We first note the following known results which we will need later. 

Theorem 2.1. (Rucinski JTlf) Let H be a graph with at least one edge, and 
Fff^e, n) be threshold function for the property that G{n,p) contains a partial 
H-factor covering all but at most en vertices. Then for any fixed e > the 
threshold function satisfies, 

FH(£,n) = e(n-^/'"(^)). 

Theorem 2.2. (Alon, Yuster fJ]/) Let H be a graph with minimum degree 
S{H), satisfying S{H) < m{H). Then 



In their respective papers, stronger results than what is stated above are 
actually proved, but we use threshold notation for consistency. 

The following two results can be found as Theorem 2.1 and 2.2 in [6]. This 
paper's aim is to provide a generalisation of the first result, which allows for 
improved bounds on the second. 

Theorem 2.3. JE^ Let H be a strictly balanced graph with ch edges. Then 
the threshold function, thnin) for G{n,p) to contain an H-factor satisfies 

Theorem 2.4. J^ For H , an arbitrary fixed graph, the threshold function, 
th-uiji) for Gin., p) to contain a H-factor satisfies 



th^H = 0{n 



-l/m{H)+o(l)~ 



In [6], the authors define threshold functions th^ (n) and th\j{n)^ for a 
given fixed graph H. Firstly, t\i\j{n) is defined as the threshold for every 

[21 

vertex in G to be covered by at least one copy of H, while ih.\j{n) is the 
threshold for the property of satisfying the following two conditions: 

1. every vertex of G is covered by at least one copy of if, and 

2. for each x G V{H), there are at least u/vh vertices x' of G for which 
some isomorphism of H takes x to x'. 

This threshold is clearly a lower bound for the threshold for finding a factor 
th.H{n) and the paper conjectures that they are, in fact, equal, and they prove 
this for strictly balanced H. In this paper, we will show this conjecture also 
holds for a large class of non-strictly balanced graphs. 

The threshold th^J {n) is completely determined and stated without proof 
in [5], for all graphs. For completeness we have included a proof below. Let 
s^ = mm{e{H') : H' C H,v e V{H'),d{H') = m{v,H)} and let s be the 
maximum over all s„. Clearly m{v, H) < m{G) for all v, with equality for at 
least one v. We are now ready to state and prove the following: 

Lemma 2.5. If for all v G V{H), m{v,H) = m{H) then 
thgl(n) = (n-i/™(^)(log(n))^/^) . 



Otherwise 



thgiH = e(n-i/'"(^)) 



Proof. We clearly have 4 cases to consider. Firstly we will look at vertex- 
balanced graphs, i.e. those that satisfy, m{v.,H) = m{H) for all v G V{H). 
Condition 1 of th^ (n) , namely that each vertex of G is covered by at least 
one copy of if, is well studied and exact thresholds can be found as Theorem 
3.22 in |5j and follow from results proved by Spencer in [13], and, in this case 
they are equal to our required bound. 

Now we simply have to prove that condition 2 is also satisfied for p = 
uj{n~^/^^^^{\og{n)y^'^). We note that we clearly have that Cp = uin'^/"^^^^) 
for any constant C . 

Let V{H) = {1,2, .. . ,vh}, our result will follow from partitioning the 
edge set of G{n,p) into the union of random graphs Go, Gi, . . . , G„^ with 
edge probability p' , where l—p = ni'=o(-'- ~P') ~ (-'- —p'Y"^^ and repeatedly 
applying Theorem 12.11 to find partial factors of H. We first apply it in G*0) 
which has edge probability p' > p/{vh + ^) = uj (^n~^^"^^^^^ , which is sufficient 
to applv [2?T] with e = 1/4. This gives us a partial if-factor covering 3n/4 of 
the vertices of G{n,p) and hence (1 — e)n/vH = 3ra/(4t>/f) vertices of G{n,p) 
are covered by each vertex of H with high probability. 

For each i = {1,2, . . . ,vh} we consider vertex i oi H and the vertices of 
Gq, that we have already covered by copies of i, and then, the random graph 
induced by the edges of Gi on the vertex set of Go, without those already 
covered vertices. This leaves us with a set of n' = (1 — 3/{4vH))n vertices in 
each Gi, that have not already been covered by a copy of the vertex i of H, 
with an independent random edge set. We can consider this as equivalent 
to the random graph G{n',p'), where p' > p/{vh + 1) = w {{n')~^^^'^^^^ 
(assuming \vh\ > 2). This allows us to again, apply Theorem 12.11 to find 
another set of partial factors on 3/4's of the remaining vertices, giving us in 
total {6/4:Vh — 9/16vjj)n > n/vn, for vh > 1, vertices covered by vertex i of 
H as required. 

We now consider graphs that are non-vertex-balanced and so do not sat- 
isfy m{v, H) = m{H) for all v G V{H). As before, the threshold for covering 
is known, and is in fact lower than our required threshold here. 

The same argument for proving condition 2 as above applies since we only 
required p = uj{n~^^'^^^^), so it follows that, for these graphs, both conditions 
are satisfied for p = uj{n~^^'^^^^). It only remains to show that the threshold 
is not lower than this for such H. This follows from another result, proved 
by Rucihski and Vince [12]. They prove, that for any vertex of G{n,p), the 
threshold for it being covered by a particular vertex f of if is n~^^^^'"'^\ 

With the result above in mind, we define the following; for a vertex 
vg G V{G{n,p)), we let X^^ be the indicator variable for vq being covered 
by a copy of v, where v G V{H) satisfies m{v, H) = m{H), namely X^^ = 



if it is not covered, and X^^ = 1 if it is. Suppose that condition 2 is satisfied 
with high probabihty. Therefore we have that 

E( ^ X,^)= Yl H^-c) > ^/^H. 

Suppose that p = o{n~^^"^^'"'^^). We know that for p in this range, X^^ = 
0, with high probabihty, and therefore E(X„g) = o(l). Since there are only n 
such choices of vq, we have a contradiction. Therefore the threshold th^ (n) 
is not o{n~^^"^^^^), and so must be n~^^"^^^\ as required. D 

In |6], it is conjectured that thjj(n) = thnin), so in light of the above, 
this can be restated as the following 

Conjecture 1. |^ -(f/o?^ all v E V{H), m{v,H) = m{H), then 
Otherwise 



th.H(.n) = n 



-l/m{H) 



3 Theorem 11.1 



The first case of Conjecture [T] has been proved for strictly balanced H, and 
now we will prove the second statement in its entirety, namely we prove that 
the threshold for containing an if -factor is th.H{n) = th^ (ra) = n'^'^/"^^^^ for 
graphs where m{v,H) < m{H) for some v G V{H). We begin by demon- 
strating that the result follows, assuming Theorem II. 2^ and then in Section 
m we return to prove it. 

In general terms, the main idea of this paper, is to 'collapse' dense sub- 
graphs of H to get a new graph (or possibly multigraph) "H, which we will 
formally define later. 



n 



Since, we have m{v,H) < m{H) for some v, we know that at least one 
vertex of H does not belong to any dense subgraphs of H. As a result, we 
will only need to cover a linear fraction of the vertices of G{n,p) with these 
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dense subgraphs, since our factor will contain at least n/vu vertices to be 
covered by copies of these less dense vertices. 

Once we have embedded the dense sub-graphs, we then use Theorem 
ll.2[ treating these (collapsed in H) embedded graphs as single vertices and 
finding a new, equally or less dense, "H-factor on these collapsed vertices, 
along with the remaining uncovered vertices of G{n,p). This will translate 
to the required factor in our original graph. 

To do this for the graph above, we would simply require a generalisation 
that allows us to partition our vertices and choose which vertex of T-L will 
'cover' the vertices of G{n,p) in our factor. However, in a more general case, 
after collapsing vertices in H we may no longer be left with a graph, but a 
multigraph, hence the required level of generalisation to use this method. 



^^ 




In this example, the densest subgraph is clearly the K^, so we would 
collapse this to a single vertex. However, one vertex of H contains edges to 
two vertices of this subgraph, leaving us with a multigraph. It is also worth 
noting that the density of K^ is 2.5 and since every vertex has degree at least 
3, this is an example of a non-balanced graph, that could not be solved by the 
minimum degree result fl] , and where we can provide the optimal threshold, 
improved on that provided by Theorem 12.41 

To define our collapsing method formally, we begin with some observa- 
tions on the effects of vertex collapsing on the density of H. We know that 
m{v, H) = m{H) for some vertices v, and these are the vertices that we col- 
lapse. For each such v, we, in turn, choose a subgraph H' such that v E H' 
and d{H') = m{H). We now collapse all the vertices in H' into a single 
vertex. Giving us a new (possibly multi) graph, which we will call Hi, which 
has the vertices of H\H', with an additional vertex vi, and an edge for each 
edge of H with an endpoint in H\H'. We continue this process, going from 
Hi to Hi^i, at each stage, collapsing a subgraph of density m{H) until none 
remain. The final graph which contains no subgraphs of density m{H), we 
will call T-L. We prove the rigour of this statement in the following lemma. 

Lemma 3.1. The collapsing process, described above, terminates after a fi- 
nite number of steps, producing a unique multigraph Ti, with m{'H) < m{H). 



Proof. Firstly note that the density of Hi, defined in the same way for multi- 
graphs as for graphs, is 

^^^^)_ eiHi) e{H)-e{H') e{H) - e{H') 



{v{Hi) - 1) v{H) - v{H') + 1-1 {v{H) - 1) - {v{H') - 1) ' 

(1) 
Noting that d{H) < ra{H) = d{H'), we can see that the above gives us, 

d{Hi) < d{H) < m{H). If instead of H and Hi, we consider any subgraph of 

H containing the vertices we are going to collapse and the resulting subgraph 

of Hi, the same inequality shows that we have not created any subgraph in Hi 

of density greater than m{H). In fact, considering the following for positive 

numbers a, b, c and d; 

a — c a a c 

b ~ d ~ b b ~ d 

(assuming b > d), and noting that we only have equality on one side if we 
have it on both, it follows from ([T]) that any vertex that is in a subgraph of 
density m{H) in -ffj+i, must have also been in such a graph in Hi. 

Since we are considering H such that, at least one vertex v, satisfies 
m{v,H) < m{H), the above shows that the collapsing process will never 
produce a subgraph of density m{H) containing these vertices and hence 
they will never be collapsed. This ensures that once all subgraphs have been 
collapsed, we will not be left with a single point, and that m{l-i) < m{H). 

The above also demonstrates that while the choice of dense subgraph 
to collapse will result in different Hi, ultimately, this process will always 
terminate with the same final multigraph, which we call T-L. To see why 
this follows, suppose a vertex lies in two different subgraphs, which we could 
choose to collapse. By ([1]) applied to the subgraph induced by the union 
of the two dense subgraphs, the new subgraph, formed by the collapsing 
process, will still have density m{H) and so the remaining vertices will be 
collapsed at a later stage to the same point. D 

It is clear that if we can embed the collapsed, dense subgraphs of H, 
required for a factor, and then embed the edges of "H we will have our 
required factor. Firstly, we prove that we can embed these dense subgraphs 
as required. Consider the graph H' with vertex set V{H) and edge set 
E{H) — EiTi), (i.e. H' contains only those edges collapsed by the above 
process). Let th.H'{n) be the threshold for embedding a factor of H' into 
G{n,p). 



10 



/r^^^iA : 



Figure 1: An example of an H and its respective l-L and H' graphs 

Lemma 3.2. For a non-vertex-balanced graph H , and the corresponding H' 
as defined above, the threshold for the existence of an H' -factor satisfies; 



th 



H'[n 



0( 



n 



^l/m(H)\ 



Proof. H' is a subgraph of iJ, and so m{H') < m{H), and since H' con- 
tains copies of the collapsed subgraphs of density m{H), it must itself satisfy 
m{H') = m{H). The edges of H' are exactly those that were collapsed in the 
process that generated l-L, and hence, any vertices of H that were not col- 
lapsed, will be isolated in H' . In other words, these vertices will have degree 
in H' . Such vertices must exist, since we assumed that m(f , H) < m{H) 
for some vertex v. Since we then have S{H') < m{H'), we can apply Theorem 
12.21 completing the proof as required. 

D 

We now have an if -factor, without the edges from each copy of H, that are 
also present in "H. To embed these final edges, we now use our generalisation 
of [22 



namely a specific application of Theorem 11.21 to find a factor of "H, 
with the collapsed subgraphs covered by the vertices we require from H. 

As in 0, we work in (a multigraph generalised form of) G{n,M), the 
graph chosen uniformly from all M-edge graphs on V (although we will use 
a multigraph generalised form of G{n,p) to prove our results) and derive a 
generalised form of 12.31 Since we will be operating with multigraphs and 
partitioned vertex sets, we need to define some notation. 

Let G be a graph on n vertices and let if be a fixed graph with vertex 



set {Xi,X2,X3, 






VH 



}. Let Ti be the multigraph obtained by repeated 



applications of vertex collapsing of subgraphs of H of density m{H), until no 
such subgraphs remain. Let k-u = |V^('H)| and h-u = |i?('H)|. If "H has vertex 
set {yi,y2, ■ ■ ■ ,yk-H\^ "^^ use lemma 13.21 to find partial factors consisting of 
n/vH copies of these collapsed subgraphs (or single vertices, for those that 
were not collapsed) for each vertex in "H. 

We use the standard method of partitioning the edges of G{n,p) into 
G{n,p') U G{n,p') where there is an edge in G{n,p) if and only if there is an 
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edge in at least one of the G{n,p'). Since we are only interested in threshold 
functions, which are equivalent up to constant factors, and 1 — p = (1 — p')^, 
which implies that p' > p/2, we can apply Lemma 13.21 without sacrificing 
randomness of the edges between these embedded subgraphs. This leaves us 
with k-u separate classes of the vertices of G{n,p), each containing graphs of 
density m{H) or isolated vertices, each corresponding to a vertex of "H. 

We now wish to find an "H-factor between these partitioned sets, but 
we are only interested in factors that connect the 'correct' vertices together 
from each partition, and hence are not interested in the edges within each 
partition, or those that are not the prescribed edges between the subgraphs 
we have already embedded. 

We can consider a random multigraph, which we call 7i{n,p), using the 
edges of our second G{n,p'), such that if the required ^/-factor exists here, it 
will translate into the required if-factor in G{n,p). Firstly, the vertex set of 
'H{n,p) consists of a single vertex for each of the isolated vertices and sub- 
graphs of density m{H), that we have embedded into G{n,p). We maintain 
the partition of these new vertices into equal sets of size n/vn, according to 
the vertex of "H, they correspond to in the initial embedding. Note that this 
means that ^{n^p) does not have n vertices, but rather kf-cn/vHi which is 
however, a constant multiple of n. 

For the edge set of T-L{n,p), we use the second edge partition G{n,p'), 
to ensure randomness. We consider a pair of vertices, Vi and V2 in ^{n^p), 
noting that we can also consider vi and V2 as sets of vertices of G{n,p), 
and the mapping a : v{H) — >■ G{n,p) that describes the already embedded 
subgraphs that form the vertices oil-Lin, p). For each xi and X2 G v{H), with 
(xi,X2) G e{H) and cr(xi) G fi and a{x2) G f2, if {a{xi),a{x2)) G e{G{n,p)), 
then we have an edge between vi and V2 in 'H{n,p), noting that we consider 
each such edge separately. In this way, any factor of "H found in l-Lin^p) will 
automatically translate into a factor of H in G{n,p). 

T-Lin^p) can also be thought of as a random A;^-partite multigraph with 
ky^n/vH vertices, and edges between vertices x and y with probability p for 
each edge between their origin vertices in H, and otherwise. Essentially, a 
series of h-^ = e{7i) bipartite graphs, in the same 'shape' as H. We will work 
using this random graph model (and the corresponding l-Lin^p) model), to 
prove our results. 

It may be helpful for some readers to visualise this as a fc-^-partite ran- 
dom graph with different edge probabilities for some of the edge partitions, 
rather than a multigraph. For example, a single edge with probability p^, 
rather than two edges, each with probability p between vertices. The varying 
probabilities make this model cumbersome to work with, however, and the 
multigraph notation is more convenient for use. 
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For some choices of H it may simply be possible to set all edge probabil- 
ities to the minimum of these values, and still find the factor, but our earlier 
graph, containing a K^, is an example of a graph for which this method would 
fail. 

At this point, we have the exact set-up for Theorem 1 1.2[ As we have shown 
that m{l-L) < m{H), it implies that for p = uj{n-^/"'^"^) > C'(n-i/™(W)+o(i)) 
(or, if "H is strictly balanced p > uj{n^^^'^^'^\\ognY^^-^^'^^^)) such an 7/-factor, 
a.a.s exists, and hence our H factor exists in our original G{n,p), as required. 



4 Theorem 11.2 



The proof of Theorem 11.21 largely follows the same steps as the original graph 
result in [6]. To illustrate the key ideas, we outline the initial setup and 
then draw out several of the key ideas of the proof. We begin the proof of 
Theorem 11.21 with a multigraph generalised version of their Theorem 3.1. 
This Theorem essentially shows that the number of factors in ^{n^p) is close 
to expectation, by demonstrating that the equivalent process of removing 
edges from the complete graph, does not remove too many factors at each 
step. 

Theorem 4.1. Let A;-^ = v{l-L) and h-^ = e{7i). For p = p{n) = u!(n~^^"^^^^) 
and M = M{n) = hy^in/vuYp, and let $(G) he the number of the H-Factors 
in a graph G, then 

Pr(<l>CH(n,p)) > (n^«-y«)'^/^He-^(")) > 1 -n^^W. 

Proof. The proof follows in the same steps as the original. Let T = hy^{n/vHy- 
M, and let 61,62, ... , 6h^(^n/vH)'^ be a random, uniform ordering of the com- 
plete form of our multigraph (i.e 'H{n,p) with p = 1), which we shall call 
KMn. Set Gi to be KMn — {ei, 62, ■ ■ ■ , Cj}, so if we let J-'{G) be the set of 
H factors on G, we define J-^ := J^{Gi). We then let ^i be the fraction of 
members of J-i containing e,. Then, as for the standard graph case, we have 

K0| Kt-l| 

and that therefore, we have 

t 

log I J-,1 = log I J-qI + Y, log(l - ^^)- (2) 

1=1 
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Here our sums start to differ somewhat from the standard graph case; we 
have 



k — 1 
log IJTJ = \og((nlvH)\f^~^ = — nlogn - 0(n). 

Vh 



Also, we have 



E 6 = , I I — ^2 ~r~^ =• ^' = E Ci ei> • • • ) ei-i 

hn[n vhY -i+l 



for any choice of Ci, . . . , ej_i. 
Therefore we have 






i=l 



^H h-uin/vHY -t 



provided that huiji/vHY — t > oj{n). We use the same property as for the 
standard graph case, namely At-, which is the event that 

log \Ft\ > log \J^o\-J2^i- ^(^) 

As before, we aim to show that with high probability At does not fail, i.e. 

fort<T,Pr(A) =r2~"(i). (3) 

This implies our theorem, since we then, setting t = T, have 

log$(?^(n,p)) = log \J^t\ > nlogn H logp — 0{n) 

Vh Vh 

(since M = h-}i{n/vH)'^ — T = hq-i^n/vHYv)- To prove ([3]), we use the same 
methods as [^, namely an Azuma's inequality, martingale argument. As 
before, we will define two auxiliary properties Ei and TZi for ^ < 1 < T — 1, 
that will allows us to establish control over the concentration of our variables. 
We set our martingale to have a difference sequence of 

„ _ J ^i — 7i if i3j and TZi hold for all j <i 
* [^ otherwise. 

And so our martingale is Xt = ^j=i Zi. We leave the formal definitions 
of Bi and TZi for Section 15. 3^ but in general terms, TZi states that each vertex 
is in close to expectation, number of copies of Ti, along with a second tech- 
nical condition, while Bi states that the maximum number of factors using 
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a particular copy of H, is close to the average over all copies of %. For all 
i < t, we will have that i3j_i and 7^j_i imply 

^, = o{\og-'n). (4) 

Our martingale analysis, will give us that Pr(|Xj| > n) < n~'^^^^ (i.e. \Xt\ < 
0{n)), and if we have Bi and TZi for i < t < T, we will then have that 
^t = Z]!=i ^i - li and therefore 

t t 

Using this, ([2]), (j4]) and the series expansion for log(l — x) we get that 

t t 

log \J^t\ > log I -Fol - 5^te + e') > log l^ol - 5^ 7. - 0{n). 

4=1 i=l 

As in the graph case, we are left with three possibilities for the failure of this 
to occur and hence, 

Pr(A) < 5^Pr(^,) + ^Pr(A,<,(i3,7^,) A A) + J]Pr(A7^,:B.)• 

i<t i<t i<t 

The previously mentioned martingale analysis shows that the second term is 
at most n"'^^^' , and we follow the same processes as [6] in Section [5] to show 
that for z < T 

Pr(^i) < n-'^(^) (5) 

and 

FriAiTZiB,) < n-^^'\ (6) 

These three bounds give us the required result. D 

In the next sections we outline the generalisation of the various results 
from the original factors paper, and include some notes on how we can apply 
them to our multigraph situation. Rather than just present a slightly modi- 
fied reproduction of ^6], and to make this generalisation of more value to the 
reader, we have first drawn out what Johansson, Kahn and Vu referred to as 
'the heart of the matter' and presented it as a stand-alone result, with our 
required generalisation and then continuing with the surrounding proofs. 

As demonstrated above, the factor result follows from showing that 
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and 

Pr(A7e,B,) < n-^^'l 

In proving the second equation, a second graph property C is introduced. 
The following shows that the failure of C results in two sets differing by a 
single vertex, such that the number of factors on the complement of these sets 
(subject to some restrictions) vary significantly. The proof of this revolves 
around the use of entropy results that we describe below, while in Section 
15.61 concentration results are used to demonstrate that the event is unlikely 
as required. 

4.1 Entropy 

We follow the results of chapter 6 of the factors paper but are left with a 
modification to make to their Lemma 6.1. As in the original, we have H{X) 
to be the base entropy of a discrete random variable X, i.e., 

H(X) = ^p(.)io,-L-^, 

where p{x) = Pr(x = X). Now, in our case, given a vertex y in a random 
multigraph G, we use X{y, G) to be the copy of "H in a randomly chosen 
?^-factor, and that h{y,G) = H{X{y,G)). We will require a slightly differ- 
ent result, than in the original, as we will only be interested in vertices y 
from a single partition set of our random multigraph. We suppose that our 
multigraph has the same structure as T-L{n,p), i.e. any copy of Ti will contain 
one vertex from each partition set of G. Given Vi, a partition set of G, we 
have the following 

Lemma 4.2. 

log$(G)<5^%,G). 

yeVi 

Proof. This result follows in the same way as in the original, using a variant 
statement of Shearer's Lemma. This states that, given a random vector 
Y = {Yi : i & I), and S, a collection of subsets of I, with repeats allowed, 
such that each element of I belongs to at least t members of S. For S & S 
let Ys be the random vector Yi : i E S. Then H(Y) < t~^ J2seS ^O^s)- If we 
let Y be the indicator for the random "H- Factor, then / is the set of copies of 
H in the complete form of our multigraph and S is the collection of sets S^, 
where 5*^ is the collection of copies of l-L containing a vertex x, taken only 
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over X G Vi. We have, therefore, that each copy of Ti, belongs to exactly one 
Sx in S, and so t = 1. It follows that; 

and since H{Ys) = h{y, G), the proof is complete. D 

The second entropy result of [6], namely Lemma 6.2, is not specific to 
graphs, and hence requires no generalisation for our uses. We state it below 
for reference. 

We let S* be a finite set, W : S ^>- TZ^, and let X be the random variable 
taking values in S with probability 

Pr(X = x) = W{x)/W{S), 

where, for a set A C 5*, ly(yl) is the sum of W over the members of A, i.e. 
W{A) = Z.^^Wix). 

Lemma 4.3. If H{X) > log l^l —(9(1), then there are a,h E rangeiW) with 

a<b< 0{a) 

such that for J = W~^[a, b] we have, 

\j\ = n{\s\) 

and 

W{J) > 0.7W{S). 

4.2 The heart of the matter 

We let ^{G) be the number of ?^-factors on a partitioned multigraph G, 
Vo be the set of vertex sets of size k-u in 'H(n,p) with a vertex from each 
partition set, and T-L{x,G) be the set of copies of "H in G containing the 
vertex x, again with each vertex from a separate partition set. We define 
D{x,G) = |7/(a;,G')| to be the number of copies of H in G containing a 
vertex x, while D{p) is the expectation of Di^x^l-iinjp)) in T-L{n,p) given a 
randomly chosen x. 

For Z, a disjoint union of elements of Vq, we define w : Vq — !■ TZ^ as 
w{Z) = ^{'H{n,p) \ Z), i.e. the number of partial 7{-factors in 'H{n,p) with 
only the vertices in Z, not covered. 

Lastly, fixing a set of vertices, Y of size k% — 1, taken from separate 
partition sets and two vertices x and y, both from the remaining partition 
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set in V{l-L{n,p) \ Y) we define w^^y '■ 'H{x,'H{n,p) — {Y U {y}}) -^ TZ^ as 
Wx,y{K) = w{K UYU {y})- In simple terms, w^^y can be thought of as the 
number of if-factors on Ti^n^p) \ {F U {y}} that use K as the copy of Ti 
containing x in the "H-factor. 

Definition 1. We say ^{n^p) satisfies A{p) if the following holds 

\og{(^ {l-i{n,p))) > nlogn H logp — Oin). 

Vh vh 

Definition 2. We say that 'H{n,p) satisfies TZtip) if the following condition 
holds. 

For each x E V , |Z)(a;,'H(n,p)) — D{p)\ = o{D{p)). 

Informally, A{p) says that the number of factors is close to expectation, 
while TZbip) says the same for the number of copies of "H that each vertex of 
%{n.iP) is in. 

For a A;^ — 1 subset of V{'H{n,p))., as always with each vertex taken from 
different partition sets; F, let Vq{Y) be the set of k-u subsets containing F, 
with the final vertex taken from the remaining partition set. 

Definition 3. We define C for'H{n,p) as follows: 'H{n.,p) satisfies C if for 
all k-n — 1 subsets of'H{n.,p), Y as above, we have the following: 

maxw(Vo(F)) < max{n"^('=«"i)<l>(?/(n,p)), 2medw(Vo(F))} 

We prove the following Theorem; 

Theorem 4.4. ATZhC implies that there exists a set of vertices Y , each 
taken from \Y\ = vh — I different partition sets of 'H{n.,p), and x,y in the 
remaining partition set, such that we can find a collection J of elements 
of H{y,n{n,p) - (F U {x})) and J' from H{x,n{n,p) - (F U {y})) with 
\J\ > n{\H{y,n{n,p) - (FU {x}))\), and w-l\J\ = nj-^lJ'] = [a,b] with 
a < b < 0{a) satisfying 

J2wyAX)>0.7w{YU{x}) 
xeJ 
and 

J2^-AX)<0.5w{YU{x}) 
XeJ' 

Proof. Suppose that A, TZb hold but that C fails. Therefore we can find at 
least one set F at which C fails. We therefore know that there exists x, such 
that w{Y U {x}) is maximum for choices of x and satisfies 

w{Y U {x}) > n-'^^^^-^^^{H{n,p)). 
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We now choose y with w(YU {y}) < med w{yo{Y)), and h{y,'H{n,p) — {YU 
{x})) maximal, given this constraint. 
Given A, we know that 

\og( ^('H(n,p))) > nlogn -\ \ogp — 0(n). 

While the failure of C tells us that 

w{Y U {x}) = ^{n{n,p) - (r U {x})) > n-'^^^^-^^^{H{n,p)), 
hence, combining the two we have, 

\og^{l-L{n,p) -{YU{x})) > ^~ n\ogn + ^^\ogp-0(n). (7) 

Vh Vh 

We use Lemma US] and apply it to the graph 'H{n,p) — {YU{x}). Letting 
Vi be the partition set containing x and y, we have, 

log$(H(n,p)-(ru{x})) < ^/i(z,H(n,p)-(ru{a;})). (8) 

zeVi 

However, we know that we chose y to have maximal entropy, chosen from 
a set of at least half of possible such z, and we also have that for any random 
variable X, the entropy H{X) < log(|range(X)|) (with equality only if the 
variable is uniformly distributed). 

The range of our random variable is contained in the set of copies of H 
containing the fixed vertex z in 'H{n,p) — (y U {x}), which by definition, is 
of size Di^z^l-iin^p) — (F U {x})) < D{z,'H{n,p)). We know from TZb that 
this is less than (1 + o{l))D{p). Hence, we have that at least half the z^s 
in §1 satisfy h{z, n{n, p) - {¥ U {x})) < h{y, Hin, p) - {Y U {x})) and the 
remaining, njlvH all satisfy h{z,T-L{n,p) — (Y U {x}) < log((l + o{l))D{p)). 
Therefore, ([8]) gives us, 

log($(H(n,p) -{YU {x}))) < J2 h{z,'H{n,p) - (F U {x})) 

z€Vi 
Ti 

< ^—ih{y, nin,p) -{YU {x}) + log(l + o{l))D{p))) 

ZVh 
Ti 

< ^^{hiy,nin,p) - (Y U {x}) + (kn - l)\ogn + hnlogp). 
ZVh 

Rearranging, to get h{y, 'H{n,p) — {YU{x})) on the left, and substituting 
from ([7j) we have, 

h{y,n{n,p) -{YU {x})) > {kn - 1) logn + hn logp - 0(1). 

19 



By 7^.5 we have that 

\og{D{y,U{n,p)-{YVJ{x]))) < \ogD{y,U{n,p)) <\og{{l + o{l))D{p)) 

= (A;^- l)logn + /i^logp + log(l + 0(1)), 

and hence combining with the above, we have 

h{y, H{n,p) - (r U {x})) > log{D{y, H{n,p) - {Y U {x})) - 0{l). (9) 

We now use our functions Wy^x and Wx,y, previously defined as; 

Wy^x{K) = w{K U F U {x}) and similarly Wx,y{K) = w{K U F U {y}). 

With Wy^x defined on H{y,'H{n,p) — {Y U {x})); the set of copies of Ti con- 
taining y in 'H{n,p) — (YU{x}), and similarly, w^^y defined on H{x, 'H{n,p) — 
(Y U {y})). Simply put, for a copy of Ti containing y in 'H{n,p) — (F U {x}), 
Wy^x is the number of "H- factors on this set, using that copy of "H. 

If we consider the random variable X{y,'H{n,p) — (F U {x})), which is 
the copy of % containing y m a. uniformly at random chosen "H-factor on 
1-L{n,p) — {Y\j{x}), we can see that the probability that Xixj^l-iiji^p) — (FU 
{x})) = H' for W E niy, H{n,p) - (F U {x}), is 

Z(^'H{y,'H{n,p)-(YU{x})) 

Also note that the denominator is equal to w{'H{n,p) — (F U {x})), since by 
summing only over copies of "H, we are counting each "H-factor exactly once. 

Similarly, X{x,'H{n,p) — (F U {y})) is determined by Wx,y, and the sum 
Y,z ^x,y{^) is equal to w{{Y U {y})). 

By the above, we have the setup used for LemmaH31 with S = Ti^y, l-iin, p)- 
(F U {x})). Noting that l^l = D{y,'H{n,p) - (F U {x})), IQ gives us the 
required condition, and we are able to apply the result to Wy^x- This implies 
that there exist a and b e range{wy^x), for which we can set J := w~l{[a, b]), 
and it will satisfy the following: 

\j\>ni\niy,nin,p)-iYu{x}m 

and 

Y,^yAZ)>0-7 Yl Wy^xiZ) = 0.7wiYU{x}). 

z€J zeH{y,G-{YU{x}) 

In simple terms, J is of the same magnitude in size as the whole pre-image 
of Wy^x, and its elements have overall weight at least a constant multiple of 
that of the whole set. 
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Equally we can set J' = w^^{[a, b]), and we know that 

J2^-AZ) < w{Y U {y}) < 0.5w{Y U {x}) 
J' 

The first inequality follows from simply summing over the full set containing 
J', and the 2nd from our original definition of y and x. This completes the 
proof. D 

Proving that this is a.a.s. unlikely to happen, requires a range of con- 
centration and technical lemmas, demonstrated in the following sections. In 
applying this result to Shamir's problem, if instead of considering factors, a 
matching of hyperedges is required, it has been shown that the proof follows 
with much more ease using a union bound argument, reducing the technical 
complexity of the proof considerably [4] . 

5 Generalisation of remaining results from 



The generalisation to partitioned structures and multigraphs of the remaining 
results and properties of [6] , follow largely from careful consideration of sums 
and bounds, and formulation of polynomials. The following sections follow 
the structure of [B] closely, and are largely a technical exercise, that offer 
little to those who have read the original paper. 

To highlight why the generalisation should follow, we note that while 
limiting the factors to these partitions appears to drastically limit the number 
of possible copies of H, since each partition is of linear in n size, we still have 
0{n^'^) = 0(^^), possible choices of vertices for each H, as in the standard 
case. 

We also address the threshold required for applying Theorem 11.21 in ob- 
taining Theorem 11.11 We are not guaranteed strict balance for the resulting 
Ti, but regardless, the collapsing process, ehminates all subgraphs of density 
m{H), and hence, m{l-L) < m(H) and so for p > n~^^"^^^\ we have a greater 
probability than required within the proof and so with Theorem 14. 1^ applied 
to 'H(n,p), on the partial factors already embedded during the collapsing 
process, we have Theorem 11.11 as required. 

Throughout the proofs, for clarity in understanding our main result, we 
treat "H, as the graph formed by the collapsing process on some H, and that 
we have p > ?2~i/'^(^) as in Theorem 11.11 but for proving Theorem 11.21 in 
full generality, Ti may not necessarily be derived from some H, and we only 
have p > 77,-i/'"(^)+o(i) (^or with a log term for the strictly balanced case). 
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In this case the proof is unchanged, as throughout, as in [6j, we only require 
that a p = (9(p^^/"), T-L contains no subgraphs of density equal to 1/a, and 
that n^'n~^p^'n = ci;(logn), which follows immediately from the conditions in 
Theorem II ■2[ given that the o(l) term decreases sufficiently slowly. 

5.1 Concentration Results 

Firstly we address the usage of the various concentration results in Section 
5 of [6j . These results are largely special cases of results by V. Vu that can 
be found in [7] (with J.H.Kim), [H] and [I5]. 

We will utilise the various polynomial results here without modification, 
and hence will not repeat the proofs here again. 

We will require some of the notation used in this section later, which 
we outline now. We let / = f(ti,t2, . . . ,tn), be a polynomial of degree d 
with real coefficients. We say / is normal if its coefficients are positive, with 
the maximum coefficient being 1, and we note that the results here are also 
true for 0{1) normal polynomials, which simply means that the polynomial's 
coefficients have some fixed bound. 

We will consider polynomials that are multilinear which means we can 
express / in the form f{t) = Y^ autu, where U ranges over subsets of [n] and 

Lastly, we need that for a set L C [n], the partial derivative of order \L\ 
with respect to the variables indexed by L is J2uDL^utu\L, and its expecta- 
tion, denoted E^ or E^/ is ^{o^u Yliemi Pi '■ ^^ -^1' where ti ~ Ber{pi). 
Set Ej/ = max|i|=jE2,/. We write E^^ = "Kj^f for the expectation of the 
non-constant part of the partial derivative of /, with respect to L, noting 
that for homogeneous, / of degree d, and < |L| < rf, we have "K^^f = E/,/. 

We take the original example used to illustrate the usage of these results, 
namely that we consider our polynomial / to be the number of copies of Ti 
in our random multigraph 7i{n,p) containing a particular, fixed vertex Xq. 

We have that / = ^^ tjj where U runs over edge sets of copies of Ti in our 
complete multigraph, containing our vertex xq. We have E/ = Q{n^'^~^p^'^), 
while for any non empty subset L of edges from the complete graph, the 
partial derivative will be E^/ = Y1iUZ)l^u\l- This will be for L that do 
not satisfy our multigraph structure requirements (i.e. at most one edge 
from each bipartite pairing forming the multigraph), in the same way that 
choosing an L not forming a subgraph of H would do, in the standard graph 
case. 

In all theorems in the chapter we are interested in ensuring that the 
maximum value of the derivative does not exceed a certain magnitude, and 
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so in this sense, we are not interested in these cases, and so they cause no 
issue in this generahsation. 

Given that our choice of L does satisfy our structure requirements, (and 
hence will be contained within at least one copy of "H in the complete graph), 
we can consider the graph formed by the edges of L, and the vertex end-points 
of these edges. Letting k'y^ and h'y^ be the number of vertices and edges 
respectively of L, then if L contains Xq, we have E^/ = C)(n'^^^^«p^«^^?^), 
and Oiji^'^^^'n^^p^'^^^'H) otherwise. Either way, we have, 

E//Ei/ = (^(n'^^-y^). 

While we do not have strict balance of "H, we do have that it contains no 
subgraphs of density m{H), and hence we have that h'-^/lk'-^ — l) < m{H) and 
recalling that p = Lu{n-^/'^'^^'>), we have that E/ = fi(l) and that Ef/ELf > 
n^^^\ as is required for applying the results in the chapter. 
For reference we include the results from the chapter below. 

Theorem 5.1. The following holds for any fixed positive integer d and pos- 
itive constant e. Let f he a multilinear, homogeneous, normal polynomial of 
degree d such that E/ > n'^maxi<j<(iEj/. Then 

Pr(|/-E/|>eE/)=n"-«. 

Theorem 5.2. The following holds for any fixed positive integer d and pos- 
itive constant e. Let f he a multilinear, normal, homogeneous polynomial of 
degree d such that E/ = co'(logn) and maxi<j<(i_i E^/ < n*^. Then 

Pr(|/-E/|>eE/)=n-«. 

Theorem 5.3. The following holds for any fixed positive integer d and pos- 
itive constant e. Let f he a multilinear, normal, homogeneous polynomial of 
degree d such that E/ = oj{\ogn) and maxi<j<(i_i Kjf < rfW^f . Then 



Pr(|/-E/|>eE/)=n 



Ml) 



Corollary 5.4. The following holds for any fixed positive integer d and pos- 
itive constant e. Let f he a multilinear, normal, homogeneous polynomial of 
degree d such that E/ < A where A = A{n) satisfies 

A > w(logn) + n^ max E,/, 

0<j<d 

then 

Pr(/> (l + e)A) <n-'^W. 
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Theorem 5.5. The following holds for any fixed positive integer d and pos- 
itive constant e. Let f he a multilinear, normal polynomial of degree d with 
Kf = a;(logn) and maxi^oE^/ < n^Kf. Then 

Pr(|/-E/|>eE/)<n-«. 

Corollary 5.6. The following holds for any fixed positive integer d and pos- 
itive constant e. Let f he a multilinear, normal polynomial of degree d such 
that E/ < A where A = A{n) satisfies 

A > u){logn) +72*^ max E^/, 

then 

Pr(/> (l + e)A) <n-"^W. 

Theorem 5.7. The following holds for any fixed positive integer d and pos- 
itive constant e. Let f he a multilinear, normal polynomial of degree d with 
max^, E^/ < n'^. Then for any (3{n) = uj{1). 



Pr(/>/3(n)) =n-'^W. 



5.2 Martingale 



The proof of the bound on the martingale follows exactly as that for the 
original paper. We have all the same bounds, namely that \Zi\ < e := log~^ n, 
and that X]j=i7« ~ 0{n\ogn). and the proof makes no use of the graph 
setting for the problem. We include the steps below for reference. 
We let Xt = Zi -\- ■■■ -\- Zt, and aim to show that 



Pr(Xi >n)<n 



M^) 



We have that Z^ is a function of the random sequence ei, . . . , Cj, but that 
E(Zj|ei, . . . , ej_i) = for any choice of the e/s. Using (jlj), it will follow from 
the properties 7?. and B, that \Zi\ < e := log~^n. We can apply Markov's 
inequality to derive the following, for any positive h; 

Pr(Xi >n) = Pr(e'*(^i+-+^') > e'*") < E(e''(^i+-+^*))e-'^". (10) 

Using Zi = ^i- 7i, we have that E(^i|ei, . . . , Ci-i) = 7^. Using < Ci < £ 
and the convexity of e^, we have 

E{e''^\eu ..., e._i) < e''^^ ((^ - 7) + 7^'^ 
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Taylor series expansions, show that the right hand side is at most e^ '^'^', for 
any < h < 1. Using induction on t we derive the following. 

E^gM^i+-+2t)) = E(E(e^(^i+-+^')|ei, . . . , et_i)) 

= E(e'^(^i+-+^*-i)E(e'^^* |ei, . . . , et_i)) 
< E(e'*'^^^+'"+^*"^)e^^^'^') 

Combined with fllOl) . we have 

We have that ^j^^ 7j = 0{nlogn) and e = log""*^ n, and so setting h 
to be a sufficiently small positive constant, leaves the right hand side as 

g-o{n) _ ^-1^(1)^ g^g required. 

5.3 The Properties B and TZ 

We now define our slightly altered properties Bi and TZi. We note, as before 

that in proving (|5]) and ([6]) we can operate in the random graph 7i{n,pi) 

rather than Gi, where 

i 

huin/vHY' 

We will define graph properties B and TZ{p) and then the event Bi will be 
{Gi satisfies B} and TZi will be {Gi satisfies TZ{j)i)}. 

In defining B, we use the same notation for the functions W , namely that 
for a finite set A and PF : A — )■ [0, cxd), set 

maxP^(yl) = maxVr(a), 

and lastly that, 

maxr W{A) = W(A)-^max Vr(A), 

with med iy(y4) the median of W on A. 

For a multigraph G with our required partition structure, and vertex set 
V , let Z he a choice of k-^ vertices from V, with each element taken from a 
different vertex partition. We then let wg{Z) = ^{G — Z). Therefore, wg{Z) 
is the number of "H-factors in the multigraph induced by G on the vertex set 
V\Z. 
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It could also be thought of as the number of ^/-factors in G, containing 
Z as a copy of "H, if all edges between the vertices of Z had been added in, 
where they are not already present. 

We also use wcij^^ = wg{V{K)) for K G 'H(G), the set of copies of "H in 
G (which will contain one vertex from each partition of G). 

We now define property B, for a multigraph G as for the graph case, 
namely 

B{G) = {maxr wg(7/(G))) = 0(1)}. (11) 

As in the graph case, B{G) states that no copy of "H in G is contained in 
much more than the average number of H- factors, for a copy of H. 

We define 7?.(p), for the most part, in the same manner as for a graph, 
with two parts to the definition. For the first part, we use almost the same 
set-up. We have G, our random multigraph and V its vertex set, and given 
A C Vin), E' C E{'H)\E{'H[A]), an injection ^p, from A to V (mapping 
vertices to the correct partition of V corresponding to their location in "H). 
We let X{G) be the number of injections : ¥{?{) — )• V with 

= ^ on A (12) 

and 

xy e E' ^ (j){x)(l){y) e E{G). 

We can write X{'H{n,p)) in an obvious way, as a polynomial in variables 
te = l{eeE(Hin,p))}, for 6, an edge in the complete form of our multigraph: 



X{nin,p)) = q{t) = J2UiE' 



where t is the indicator function for edges of the multigraph, and the sum is 
over all injections (p satisfying flT2|) . 

As in the original paper, we have that this function q{t), is multilinear, 
(9(l)-normal and homogeneous of degree d = \E'\. We use the same definition 

for E*; 

E* = maxjE^g : \L\ < d}. (13) 

We also use the same definition for D{p), using it as the expected number 
of copies of "H in l-l{n,p) using a given vertex x G V, while D{x,G) is the 
actual number of copies containing x in G. It is clear that 

D{p) = {n/vuf'^'^P^'^ = ein''^~^p^^). 

We are now ready to define 7^(p), which is identical to the graph formulation, 
not taking into account our slight changes to the above notation. 
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Definition 4. We say that a random multigraph G satisfies TZ{p) if the 
following two conditions hold. 

(a) For A, E' and tp (and associated notation) as above: z/E* = n''^^^\ 
then for any P{n) = u{l), X{G) < P{n) for large enough n; if E* > 
n~°^^\ then for any fixed e > and large enough n, X{G) < n'^K*. 

(b) For each xeV, \D{x, G) - D{p)\ = o{D{p)) 

We can now prove that these conditions give the required bound on the 
size of ^i. The proof follows in the same fashion as in the original paper: 

Lemma 5.8. Fori <T = hy^ln/vn)'^ — M as defined in Section\^ Bi-i and 
IZi-i (i.e. that Gj_i satisfies B and7l{pi)) imply Q) 

Proof. Write w for wg,_i- We aim to show that Bi-i and TZi-i imply that, 
for any K G 7i{Gi-i), 

w{K)mGi^i) = 0{1/D{pi_,)). 

As before, the left hand side of this equation is the fraction of "H-factors in 
(jj_i that use K, and we prove this result in the same fashion, since we have; 

<l>(G,_i) = --IL^,(^niG,.^)) 

= ^fi(|?/(G,_i)|maxu<-H(G,;_i)) 
= n{Dip,^^)w{K)). 

The first line follows, since each "H-factor will be counted n/vn times by 
summing the w function over all copies of "H. The second line follows from 
applying B, while the third comes from part (b) of TZi-i, and noting that 
iknn/vH)D{p) = kn\niGi_,)\. 

We also have, using the same arguments as in the original, that part (a) 
of TZi^i implies that the number oi K E 'H(G'j_i), containing a given edge 
e e E{Gi-i) is at most /3(n), satisfying /3~^D(pj_i) = ci;(logra). Here we also 
use that D{pi_^i) = n^'^~^p^]^-^ = ci;(logra), for i <T. D 

Lastly we state the required 'p- version' of At'. 

A{p) = I log I J-(G)| > log I J-ol - ^^7. - Oin) 

where t = [(1 — p)/i^(n/t'//)^'|. Recall from the end of Theorem 14. ![ that our 
required result will follow from the following lemmas 
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Lemma 5.9. For p > uj{n i/^-C^))^ 

Pr(H(n,p) satisfies n{p)) = 1 - n^'^'^^K 
Lemma 5.10. For p > cu^n^^^"^^^^), 

Fi{n{n,p) satisfies Aip)n{p)B) = n~'^^'^\ 
We prove these lemmas in the next subsections. 

5.4 Regularity 

We now prove Lemma 15^ i.e. that with probabihty 1 — n^'^^^\ G = 'H{n,p) 
satisfies both parts of the definition of TZ{p). 

Part (a) follows easily in the same fashion as the original paper. There 
are only rP'^^' choice for each of A, E' and if)^ so we simply show that the 
probability of one of these violating (a) is n"'^'^^^ . As we noted earlier, we can 
express the random variable as a polynomial, and since it is homogeneous, 
multilinear, and (9(l)-normal, we can apply the probability results from the 
concentration chapter of [6J directly, which gives us the result as required. 

Proving (b) requires only slightly more adaptation to our scenario. We 
again express D{x,G) as a polynomial of degree h-^ = |£'('H)|, in the vari- 
ables te = leeE(G); whcre e belongs to the complete form of our random 
multigraph. Therefore we have, 

D{x,G) = fit) := J2{tK ■■ K e -Hoix)}, 

where 'Ho(x) is the set of copies of of Ti, containing x, in the complete 
multigraph, and tx = YleeK^e- 
As noted earlier, we have, 

E/ = {n/vH)^'^'^p'''' = e(n'=«"y^) = w(logn). 

We aim to use one of the concentration results from [0], namely Theorem 15.31 
which requires the above and that maxi<j<(i_i Kjf < n~'^E,f. For L a subset 
of the edges of the complete multigraph, with 1 < \L\ = I < h-u, we have 

where N{L) is the number of K G "Hola;) with L C E{K). Let / = V{L)U{x}, 
V{L) being the set of vertices incident to the edges of L, and k!^ = \I\. Then 
N{L) = Q{n^'^~^'n) if the graph "H' := {I-,L) is isomorphic to a subgraph of 
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H, and zero otherwise. Therefore we have, recalhng that p > n i/™(^) and 
diH) = eH/ivH-l), 

= l](n[iMW')-i/rn(^)]/) =^f^(i) 

Using the fact that "H contains no subgraphs of density m{H) (or denser). 
We therefore have the required conditions to use the concentration theorem 
and the result follows, which provides us with part (b) of 7^ as required. 

5.5 Proof of Lemma 15.101 

We now begin the proof of Lemma I5.10[ continuing in the same vein as the 
original paper, as there, we will prove that B is satisfied, using an auxiliary 
event, C. Most of these results follow in an identical manner to the original, 
but with small conditions on the choice of sets, and differing constant powers 
in the equations (largely from use of ky, rather than vh)- We include these 
modified results for completeness. 

We write Vq to be the collection of k-^-sets of V^ = V{'H{n,p)), with a 
single vertex from each partition oi'H{n,p). For a set F C V, with |y| < k-^ 
and at most one vertex from each partition, we write Vo{Y) for the set 
{Z E Vq : Z ^ Y}. We then extend our earlier weight function w = w-}i{n,p) 
to these sets Y, by setting 

wiY) = J2MZ) : Z e VoiY)}. 

We define our new property C for l-iiji.p) as follows: 'H{n,p) satisfies C if for 
all such Y, as defined above, with \Y\ = k-^ — 1, 

maxw{Vo{Y)) < ma.x{n~^^^^-^^^{n{n,p)),2med w{Vo{Y)} 

We will then prove Lemma I5.10[ by proving the two following results. 

Lemma 5.11. Pr(^7^C) = n-'^^^l 

Lemma 5.12. Ft{TZCB) = n-'^^^l 

We have already demonstrated the first part of the proof of Lemma 15.111 
in Section 14. 2[ For clarity, to follow the arguments of the original paper, 
we will first prove Lemma I5.12[ before returning to Lemma 15.111 in the next 
Section. We must firstly show that 

\{K e Vo : w{K) > 6maxw{Vo)}\ = fi(|Vo|) (14) 

29 



Noting that |Vo| = r2(n^^), and that this imphes niaxrw(Vo) = 0{1). We 
now need another small modification of a lemma from the graph case. We 
let ipiX) = maxw(Vo(X) and let -B be a positive number, recall that V is 
the vertex set of ^{n^p), and is of size k-nn/vn, with k-^ partitions of size 
n/vH- 

Lemma 5.13. Suppose that for each Y <Z V , satisfying \Y\ = k-^ — 1 and 
with at most one vertex from each partition ofV, and ip{Y) > B we have 



z e VoiY) : w{z) > i^(r) 



^ n/vH - k-H 



Then for any X O V with \X\ = k-^ — i, at most one vertex from each 
partition and ipiX) > 2^^^B, we have 



Z £ V„(A') : tti(A') > i«.(.Y) 



> r-^^^^] ,^^. (15) 



Proof. We write Ni for the right hand side of the above equation, and proceed 
by induction on i. The case z = 1 is the hypothesis of the lemma. We then 
assume X as stated, and choose Z G Vo{X) with w{Z) = ipi^X) (i.e. Z such 
that w is maximal). We let y G Z\X and Y = X U {y}. We then have 
that |r| = k-H- {i -I) and that ipiY) = ipiX) > 2'-^B{> 2'-^B), and 
so, by the inductive hypothesis there are at least A^j_i sets Z' G Vo{Y) with 
w{Z') > 2-(^-i)^(r). For each such Z', Z'\{y} is a {kn - l)-subset of V 
with tlj{Z'\{y}) > w{Z') > B. So then for each such Z', there are at least 
{n/vH - kn)/2 sets Z" G VoiZ'\{y}) with 

w{Z") > HZ'\{y})/2 > 2-V(X). 

Therefore the number of such pairs, {Z,Z") is at least Ni^i{n/vH — ky)/2. 
Equally, for each Z", each corresponding Z' is Z"\{u} U {y} for some u G 
Z"\[X U {y})- Therefore the number of such Z' is at most i — 1, providing 
our factorial term. This completes the proof. 

D 

We now continue the proof of Lemma [5.121 We set 6 = 2^'^ and then C im- 
plies the hypothesis of the above lemma, with B = (2n)~'^^^^^)$(7{(n,j9))(> 
n~^(^«"^)$('H(n,p))). We also clearly have that 

V^(0) > n"('^«-^)<l>('H(n,p)) = 2^«-^5. 

We also set 7 = {2'^'^^^ {k-^ — 1)!)""^, and using flT^ . we now have 

\{K G Vo : w{K) > 6maxw{Vo)}\ > -fn^^. 
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We let J be the largest power of 2, not exceeding maxw and 

Z = {Z eVq: w{Z) > 6 J}. 

In a sense, Z can be thought of as the vertex sets of size ky^, whose com- 
plement has a relatively large number of factors. For any set X C \/ with 
|X| < k-y, with at most a single vertex from each partition, let Z{X) = {Z e 
Z : X C Z}, and say such a set X is good if |2(X)| > 7?t,'^^~I"^I. In particular 
we know that the empty set is good. We then fix an ordering oi, . . . , a^^ of 
V^H). For distinct vertices, xi, . . . ,Xr G 'H{n,p), we define S{xi, . . . , Xr) to 
be the collection of copies of "H in the complete multigraph KMn, for which 

(j){ai) = Xi for i e [r], 

(f){ai)(j){aj) G E{'H{n,p)) whenever i,j >r and aiGj G -E'('H) 

and that 0(V"('H)) G Z. 

For each r G {0, . . . , A;-^} let A/'r = N{ar) fl {a^+i, . . . , Ofc^}, and dr = 
\Nr\, where N{ar), means the neighbourhood of a^ (in "H). We now let 
y{xi, . . . , Xr) be the event 

{|5(xi, . . . ,a;,)| = fi(/'-+-+^*«-in^«-")}. 

Note that in particular we have, d^^ = 0, and, recalling that T-LiG), is the 
set of copies of "H in l-iin^p), 

5(0) = {0 G ?/(G') : w{<j){V{n))) > 6J} 

and that 3^(0) is the event 

Then, for Vi, . . . ,Xr G V, and from distinct partitions, let Q{xi, . . . ,Xr) be 
the event 

{{Xi, ...,Xr} is good} A y{Xi, ...,Xr)}. 

Since we have shown that C implies that the empty set is good, we have that 
BC C (5(0) and therefore, we simply require to show that 



Pr(7^Q(0)) = n 



Ml) 



to prove Lemma 15.121 we continue (as always, in the same fashion as for 
the graph case), by proving a slightly more general argument for induction 
purposes, namely, that for any choice of r and vertices xi, . . . ,Xr, 

Pr(7eQ(xi,...,x,)) = n-"«. (16) 
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Our induction is on k-^ — r, with our initial step r = fc^, trivially following 
since the definition of being good for subsets of size k-^ is to belong to Z. 
For general r < A;-^, we set X = {xi, . . . ,Xr}- and then let V be the event 

{y G V\X, X U {y} good ^ 3^(xi, . . . , x,, y)}. 

By the inductive hypothesis we know that Pt(JZV) = n'^^^^\ so we only need 
to show 

Pr(7^PQ(xl,...,x,)) <n-'^W. 

We also note that if X is good then, 

\y:XU{y}good\=n{n). (17) 

To ensure that the edges between Xr and V\X are independent of the initial 
conditioning, we use a relaxed form of 7^, TZx, which we say is satisfied if it 
satisfies part (a) of TZ, whenever A = {ai, . . . , a^}, ip^ai) = Xi {i E [r]) and 
E' C {H -A). 

As for the graph case, if TZV A {X good} holds, but y{xi, . . . ,Xr) does 
not, then there must be some J = 2^'^, with k-^ and integer not exceeding 
n log n (the magnitude of the log of the number of "H-Factors in the complete 
multigraph), such that with Z good, we have the following, (noting that 
throughout this chapter, wherever we choose sets of vertices from V — X, we 
choose them from partitions that do not contain vertices of X), 

(a) TZx holds; 

(b) There are at least Q{n) y's in V\X for which we have y{xi, . . . ,Xr,y) 
(by (HZ])), and lastly, 

(c) y{xi, . . . , Xr) does not hold. 

We note, that for a given J, the first two properties, depend only on 
G' := G — X. Since the number of possibilities for J is at most nlogra, it is 
enough to show that for any J and G' satisfying (a) and (b) (with respect to 

J), 

PT{y{x,,...,Xr)\G')=n-^'^'l 

Given this fixed G', we can express |S'(a;i, . . . , Xr) \ as a multilinear polynomial 
in terms of the indicator variable for edges between Xr and other vertices of 
H{n,p), i.e. 

tu '■= '^{xrU€E('H{n,p)} U e V\X. 

giving the polynomial. 



\S{xi,...,Xr)\ = git) := ^aut 



u 
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where U ranges over d^ subsets of V\X, with the vertices taken from the 
correct partitions, corresponding to the edges from a^ = (f)''^{xi) in 7i, and 
au is the number of copies ip oi K := Ti — {ai, . . . , a^} in G' with the induced 
subgraph, 

i^{Nr) = U 

and 

ipdar+i, ...,ak^})UXeZ. 



We now apply a concentration result from Section I5.H namely Theorem I5.3[ 
To apply this, we require a normal polynomial, so we normalise, and consider 

fit) = a-'git), 

where a is the maximum of the a^y's. The hypothesis requires that E/ = 
c<;(logn) and maxi<j<rf_i E^/ < n~^E/, and will allow us to say that it is 
close to expectation. We rewrite: 

g{t) = Y^ ^{t0(7v,) : 0e ^(xi,...,x^,?/)}. 

y&V\X 

Since we know that the indicator variables, t^, u & ^\-^, are independent 
of G', which determines our sets S{xi, . . . , Xr, y), and using property (b) our 
situation, we have 



yev\x 



■+dfc„-i^fc? 



Noting that if dr = 0, then there are no random edges to consider, and we 
have that \S{xi, . . . , Xr)\ will equal 

J2 \S{xr,...,Xr,y)\ = n{p''^+-+'''^^-^n'^-n, 
yev\x 

as required, we will now assume that d^ > 0. 

We now set Ti' = Ti — {ai, . . . , a.r_i}, and so dj. + ■ ■ ■ + d^-^^i = e{H') 
and that k-^ — r = v{'H') — 1. This gives us that the right hand side of the 
expectation of g^ is Vt^p^^^ )n'"^'^ )~^). Using that p > n~^^'^^^^ and that "H 
contains no subgraphs of density m{H), we have 

Fn- / ^(log^) if '^ = 1 
^^"\n^« ifr>l. 

We will now show that for the normalised polynomial f{t), that 

E/ = u;(logn) (19) 
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and 

max{Er/ : T C V\X, < |T| < rfj = n^^^^^Ef. (20) 

With these conditions, the concentration theorem will tell us that /, and 
therefore g is close to its expectation, which implies 3^, as required. To prove 
the two conditions, we will find it easier to consider the partial derivatives 
of g rather than /. We use t^ = leeE{Hin,p)), ts = Hees^e and t = {t^ : e e 
E{KMv\x)), where KMv\x is the multigraph induced by the 'complete' 
multigraph KM^ on the the vertex set V\X. 

Since we are only interested in establishing upper bounds on the partial 
derivatives oi g, we may now disregard the second requirement on au-, namely 
■j/'da^+i, . . . , ak-u}) U X G ^. Therefore we are left with 



p-{.dr-i)^^g < r(t) :=^t 



<I>{E{K)), 



where we sum over 0, injections such that 

: V{K) -^ V\X with (j){Nr) 3 T. 

We set E\ as before to be E* = maxjEir : L C E{KMv\x), \L\ < 
\E{K)\}. We will show that there is a positive constant e (depending only 
on Ti), such that (for large enough n), 

pdr-i^* < n-'Eg. (21) 

This will give us the two requirements for our concentration theorem, as 
follows. To prove (HM . we need to show that 

a~^Eg = a;(logn). 

We apply (1^ with T = U, a rf^-subset of V/X. We consider the two possible 
conditions of TZx, which gives two separate cases, firstly if Eg > n^/^, (i.e. 
E* > n-°(ih then TZx tells us that 



au 



Eug < r2"/^max{l,E*} < n^'^^Ef. 



In the other case we are left with E* < n"^/^, and hence E* = n~^^^\ We 
know that Eg = uj{logn), and hence we choose our /3(n) = uj{l) (from TZ) 
such that f3{n)^^Eg = ci;(logn). Since this does not depend on our choice of 
U, we have flT^ as required. 

To prove the other requirement, we need Enpg = n~^^^^Eg for any T, as 
we defined in (120|) . We again apply (12T|) . noting that we can decrease the 
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e without violating the equation, and use this observation to assume that 
e < l/m{H). This gives us 

Erg < /'■"'n"/^max{l,E*} < n-'^^Eg 

as we required. 

We now return to prove (HTh . We fix L C E{KMv\x) and let hi = 
\E{K)\ - \L\, (recalling that K = H - {oi, . . . , aj). We know that E^r = 
P^'-Nl, where again Nl is the number of (p, as defined just before fl2T|) . with 
(j){E{K)) D L. Each satisfies (l)iV{K)) D I := T U V{L), where as earlier, 
V{L) 3 V\X is the set of vertices incident to the edges of L. We let / = 
{«!,..., is}, then we have N^ = ^ Ni^{bi, . . . ,bs), where (61 ... , bg) range 
over s-tuples of distinct elements of V{K) and we sum over the number of 
0's as above, with 0(&j) = ij for each 1 < j < s. We only have 0{1) choices 
for the 6j's and hence the result will follow if we can show that for any such 
choice, 

/'-'+''' A^i (61, ...,bs)= n-^^^^Eg. 

Given a fixed choice of 6i's, let T-L" = T-L[{ar,bi, . . . ,bs}]. We know that 
A^l(6i, . . . , bs) < n^^-"-' = ri^^^-'-C^CH")-!)^ ^^^ ^j^^^^ h > I + dr+i + ■■■ + 
dk^-i — e{T-L"), since \E{K)\ = rfr+iH — ■ + dk^_i and E{l-i") contains (p^^i^L) 
and at least I edges joining a^ to V{K). Therefore we have 

/'-'-'^'A^i(6l, . . . , 6,) < pdr+-+du^-^^kn-r^^v{'H")-l^e{H")Y\ 

Noting that since p > n~^^'^^^^ and that "H contains no subgraphs of density 
m{H), we have that the expression in the square brackets is n^^^\ Combined 
with our earlier bound on Eg, from (1181) . we have the bound required above 
and hence (1211) . completing the Lemma. 

5.6 Proof of Lemma 15.111 

We now begin the final step of the proof, namely that Fi{ATZC) = n~^^^\ We 
maintain our use of notation from the previous chapter with G = 'H{n,p), and 
TZ and A. In Section 14. 2^ we have shown that ATZC results in the following; 
There exist x and y; vertices in the same partition set of G = 'H{n,p) 
and a set Y with a single vertex from each of the remaining partitions, such 
that there exist, a,b & range{wy){= range{w')), such that J := w~^{\a,b]) 
satisfies 

\J\>n{\n{y,G-R)), 

and 

Wy{J) > O.rwyiniy, G-R)) = 0.7w{R). 
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With, J' = w^^{[a,b]), we also have, 

Wy{J) > 0.7w{R), 

and 

w^if) < w{S) < 0.5w{R). 

All that remains is to show that the probability of this event is n~'^^^'> . 

Once again, we can express Wy{J) and Wx{J') as evaluations of a multi- 
linear polynomial in variables {tu '■ u G W}, once we have conditioned on the 
value of G[Vr], in the following way. Given a set t/ C PI/ with \U\ < k-^ — I 
and at most one vertex from each partition, let G^ be the graph obtained 
from G[iy] by adjoining a vertex w* say, with neighbourhood U. We let JCu 
be the set of copies of "H in G^, containing {w* : u E U}, and 

«c/ = J2^w\ViK)\{w*}) : K E ]Cu,w'iViK)\{w*}) E [a,b]}. 
Our polynomial now becomes, 

9{t) = ^ ctutu, 
ucw 

and Wy{J) and Wx{J') are simply g evaluated at the point t' := '^{z<^w:yzeE{G)} 
and t := l{^eVK:a;2e-B(G)}- 

TZ tells us that Dciy) = Q{n^'^~^p^'^) and therefore, considering that at 
most o{n^'^~^p^'^) copies of H lie in G and contain y and meet i?, we have 
\H{y,G-R)\ = e(n'=«-V) =w(logn). 

We know from our conditioning of J, that it satisfies 

and 

Wy{J) = 0(6n'=«-y'^). (22) 

We now apply a concentration result, namely Corollary 15. 6[ To apply it, we 
require a bound on the expectation of /, namely, that if Ef < A, where 

A > wflogn) + n'^ maxE'r f. 

then the corollary gives us Pr(/ > (1 + e)^) = n^'^^^\ We fix T C W, with 
\T\ = I < ky^, and as always, each vertex from a different partition set. 
For d = /,..., A;-^ — 1,, and t as t = (tg : e G E{KM]v)), we consider the 
polynomial, 
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Where z ranges over vertices of 1-i of degree d and over the injections 
V{E)\{z) -^ W with 0(A^^) D T. Then we know that 

ax < b hiit). 

Since /i/(t) is the number of such sets, and we know that their values are 
bounded by 6, this also gives us 



E^^<&$^/-'/^d(t), 



where WJj- is the non-constant part of the partial derivative. 
We let 

E* = m&x{ELhd : L C E{KMw), \L\ < h^ - d}. 

As in the previous chapter, similarly to (12T|) we can assert that there is 
positive constant e dependent only on H, such that, for each d, 

This follows from the proof of (|2T|) in the previous chapter, by noting that 
in our definition of hd, there are only finitely many z, and the inner sum, is 
bounded by the polynomial r, used in (pTj) . with r = 1, ai = z and hence 
dr = d. 

We are now able to apply the concentration results we require. We con- 
sider the polynomial / = a^^g, where a = maxf/ au- Then we have, using 

f{t') = a-^Wy{J) = Q{a-^hn^^-^p^'^) 

and using the same arguments as we used at the end of the previous chapter, 
we can show 

fit') = w(logn) 

and 

max{E^/ : T C Vr,T ^ 0} = n-^^^^f{t'). 

For the final step, we first summarise that we have shown ATIC implies that 
there exist y, x, y and a, b & range{w') for which we have the above two 
conditions, and 

fit") < O.Sfit'). 

However, we know that for any given choice of y, x, y, a, and b, f depends 
only on (^[li^]. Equally, if G[H^] is fixed, then t and t' are independent random 
variables, with 'law' BiniW,p) (i.e. t = (t^ : w G W) has 'law _Bm(VF,p)' 
if each ty^ is an independent Bernoulli with mean p.) We finish with the 
final claim of j6j, which can be applied directly to our result here, without 
generalisation, and completes the proof. 
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Claim 1. For any e > and d the following holds. If f is a multilinear, 
normal polynomial of degree at most d in n variables, C,{n) = uj{logn), and 
t',t" are independent , each with law Bin{[n],p), then 

Pr(/(t') > max{C(n),n^maxE;/, (1 + e)/(t")}) = n-\ 

Because there are only polynomially many possibilities for Y, x, y, a and b, 
this gives us Lemma \5.11\ 

Proof of claim. Set 

A= — max < Cin) , n'^ max E^ f. 

If E/ < A then Corollary 15.61 gives 

Pr(/(t') > A) =n-'^W; 



otherwise, by Theorem I5.5[ 

Pr{f{t') > (l+6)/(t")) < Pr(max{|/(tO-E/|, |/(t")-E/|} > (6/3)E/) = n-^'\ 

n 



6 Theorems 11.21 and 11.3 



To prove Theorem II .2^ we note that our proof of Theorem ll.2l does not require 
the collapsed graph to be a multigraph. Throughout, we only require that 
the graph does not contain any subgraphs of density m{H), and this follows, 
in our main result, from the vertex collapsing technique, but as in 0, it can 
also follow from strict balance of "H, or equivalently H. Since our partitions 
were fixed only by the partial embedding of subgraphs of density m{H), in 
the strictly balanced case, we can simply choose our vertex partitions freely, 
and then continue with the proof. 

Lastly, to prove fT73| we use Theorem II. 2 [ Partition the vertices of G{n,p) 
as usual into vh equal sets, either as a result of vertex collapsing dense sub- 
graphs, or freely in the strictly balanced case. In the former case, embedding 
the partial factors of dense directed subgraphs, requires a simple modifica- 
tion of Theorem 12.11 which follows the same arguments as the original but 
for directed graphs. A proof of which will appear in y^. We again consider 
the edges between the partitions, only where they correspond to edges in 
H. Now however, we can consider only the edges that are in the direction 
we require, which are distributed uniformly and independently at random, 
but with edge probability p/2, since we discount those edges in the wrong 
direction. At this point, the random digraph is equivalent to 'H{n,p/2), and 
we apply 11.21 directly, providing the directed factor as required. 
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7 Conclusion 

With these results, Conjecture [1] is now proven for both strictly balanced 
graphs, and all non-vertex balanced graphs. The methods used here also can 
be used to prove that the conjecture holds for a wide range of vertex-balanced 
H. 

We can prove a threshold for a variety of 'necklace' and related graphs 
formed of copies of a dense graph linked by a 'supergraph' of equal or lower 
density. 



•■•v^"' 



As an example, in the above case, the threshold for finding a copy of this 
'triangle necklace' must be at least that of a triangle factor, and Conjecture [1] 
suggests it should be equal. We prove this by first embedding a triangle 
factor, and then use 11.21 to embed the dotted edges that form a less dense 
cycle in the supergraph. Provided the resulting graph or multigraph from 
the collapsing process is less dense than m{H), we will always be able to 
apply Theorem 11.21 to find this. 

The supergraph cannot ever be denser than m{H), as this implies a sub- 
graph of density greater than m{H), which is a contradiction. So, all that 
remains to consider are graphs where we will be left with a supergraph of 
equal density, after collapsing all subgraphs. We know this must be strictly 
balanced, or we would have continued to collapse its subgraphs and so we 
can apply Theorem II. 2[ However, the supergraph may have fewer edges 
than each of the collapsed subgraphs, and certainly has fewer than the orig- 
inal graph. This may force it to require a higher threshold, by a constant 
power of log to embed, and so will not prove the conjecture. 



/n 



For the above graph, the conjectured threshold is n ^'^{lognY'^, but us- 
ing the methods within this paper, we are only able to embed it at p = 
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0{n^^^^ (log nY^^), since collapsing the triangle will leave us with another 
triangle, requiring the higher log term to embed. 

In light of this, we see that we can prove Conjecture [1] for all H, except 
for those mentioned above, for which we are still within a constant power of 
log of the conjectured bound. 
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